255 results found.
Language Type:
Trilingual
Languages:
Egyptian Arabic English Mandarin Chinese
Availability:
The Data Will Be Published Via LDC General Catalogue
License:
<Not Specified>
Size:
1936987 words Production Status:
Newly created-finished
Use:
Anaphora, Coreference
-
Paper title:Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 2 | Martha Palmer | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 3 | Nianwen Xue | Computer Science Department, Brandeis University | US | ||
| Author 4 | Lance Ramshaw | Raytheon BBN Technologies | US | ||
| Author 5 | Mohamed Maamouri | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 6 | Ann Bies | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 7 | Kathryn Conger | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 8 | Stephen Grimes | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 9 | Stephanie Strassel | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Main Contact | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | None |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
Czech English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
eight million tokens Production Status:
Newly created-finished
Use:
Semantic Role Labeling
-
Paper title:Towards Comparability of Linguistic Graph Banks for Semantic Parsing
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Stephan Oepen | Universitetet i Oslo | NO |
| Author 2 | Marco Kuhlmann | Linköping University | SE |
| Author 3 | Yusuke Miyao | National Instutite of Informatics | JP |
| Author 4 | Daniel Zeman | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 5 | Silvie Cinkova | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 6 | Dan Flickinger | Stanford | US |
| Author 7 | Jan Hajic | Charles University in Prague | CZ |
| Author 8 | Angelina Ivanova | University of Oslo | NO |
| Author 9 | Zdenka Uresova | Charles University in Prague | CZ |
| Main Contact | Stephan Oepen | Universitetet i Oslo | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Kazakh Mandarin Chinese
Availability:
From Owner
License:
ELRA
Size:
52, 478 entries Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Bilingual Dictionary Induction as an Optimization Problem
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Wushouer Mairidan | Department of Social Informatics, Kyoto University | JP |
| Author 2 | Toru Ishida | Kyoto University | JP |
| Author 3 | Donghui Lin | Department of Informatics, Kyoto University | JP |
| Author 4 | Katsutoshi Hirayama | Graduate School of Maritime Sciences, Kobe University | JP |
| Main Contact | Wushouer Mairidan | National Institute of Information and Communications Technology (NICT) | None |
Documentation:
(In process)
Written
Corpus,
Language Type:
Trilingual
Languages:
English Japanese Mandarin Chinese
Availability:
Freely Available
License:
http://lotus.kuee.kyoto-u.ac.jp/ASPEC/#agreement.html
Size:
3680000 sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:ASPEC: Asian Scientific Paper Excerpt Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Toshiaki Nakazawa | Japan Science and Technology Agency | JP |
| Author 2 | Manabu Yaguchi | Japan Science and Technology Agency | JP |
| Author 3 | Kiyotaka Uchimoto | National Institute of Information and Communications Technology | JP |
| Author 4 | Masao Utiyama | National Institute of Information and Communications Technology | JP |
| Author 5 | Eiichiro Sumita | National Institute of Information and Communications Technology | JP |
| Author 6 | Sadao Kurohashi | Kyoto University | JP |
| Author 7 | Hitoshi Isahara | Toyohashi University of Technology | JP |
| Main Contact | Toshiaki Nakazawa | Japan Science and Technology Agency | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
10000 Production Status:
Newly created-finished
Use:
Chinese word segmentation, parsing, machine translation
-
Paper title:Parsing Chinese Synthetic Words with a Character-based Dependency Model
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Fei Cheng | Nara Institute of Science and Technology | JP |
| Author 2 | Kevin Duh | Nara Institute of Science and Technology | US |
| Author 3 | Yuji Matsumoto | Nara Institute of Science and Technology | JP |
| Main Contact | Fei Cheng | National Institute of Informatics | None |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
English Japanese Mandarin Chinese
Availability:
From Owner
License:
In Progress
Size:
25 Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Sakriani Sakti | Nara Institute of Science and Technology | JP | ||
| Author 2 | Keigo Kubo | Nara Institute of Science and Technology | JP | ||
| Author 3 | Sho Matsumiya | Nara Institute of Science and Technology | JP | ||
| Author 4 | Graham Neubig | Nara Institute of Science and Technology | US | ||
| Author 5 | Tomoki Toda | Nara Institute of Science and Technology | JP | ||
| Author 6 | Satoshi Nakamura | Nara Institute of Science and Technology | JP | ||
| Author 7 | Fumihiro Adachi | NEC Corporation | JP | ||
| Author 8 | Ryosuke Isotani | NEC Corporation | JP | ||
| Main Contact | Sakriani Sakti | Nara Institute of Science and Technology | None | Nara Institute of Science and Technology (NAIST) / RIKEN AIP | None |
Documentation:
Not yet
Written
Lexicon,
Language Type:
Multilingual
Languages:
Mandarin Chinese Uighur
Availability:
From Owner
License:
ELRA
Size:
52, 478 entries Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Bilingual Dictionary Induction as an Optimization Problem
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Wushouer Mairidan | Department of Social Informatics, Kyoto University | JP |
| Author 2 | Toru Ishida | Kyoto University | JP |
| Author 3 | Donghui Lin | Department of Informatics, Kyoto University | JP |
| Author 4 | Katsutoshi Hirayama | Graduate School of Maritime Sciences, Kobe University | JP |
| Main Contact | Wushouer Mairidan | National Institute of Information and Communications Technology (NICT) | None |
Documentation:
(In process)Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
4000 sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Building A Case-based Semantic English-Chinese Parallel Treebank
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | huaxing shi | Harbin Institute of Technology | CN |
| Author 2 | Tiejun Zhao | Harbin Institute of Technology | CN |
| Author 3 | Keh-Yih Su | Institute of Information Science, Academia Sinica | TW |
| Main Contact | huaxing shi | Harbin Institute of Technology | None |
Documentation:
<Not Specified>
Speech
Lexicon,
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
Creative Commons 1.0 Universal
Size:
184195793 KByte Production Status:
Newly created-finished
Use:
Psycholinguistic resource
-
Paper title:Database of Mandarin Neighborhood Statistics
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Karl Neergaard | The Hong Kong Polytechnic University | HK |
| Author 2 | Hongzhi Xu | The Hong Kong Polytechnic University | HK |
| Author 3 | Chu-Ren Huang | The Hong Kong Polytechnic Universiy | HK |
| Main Contact | Karl Neergaard | The Hong Kong Polytechnic University | None |
Documentation:
https://github.com/karlneergaard/Mandarin-Neighborhood-Statistics/blob/master/README.md
Written
Corpus,
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
Freely Available
License:
ADAPT Centre, Huawei, DCU
Size:
100000 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Automatic Construction of Discourse Corpora for Dialogue Translation
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Longyue Wang | ADAPT Centre, School of Computing, Dublin City University | IE | ||
| Author 2 | Xiaojun Zhang | CNGL Centre for Global Intelligent Content, Dublin City University | IE | ||
| Author 3 | Zhaopeng Tu | Huawei Noah's Ark Lab | HK | ||
| Author 4 | Andy Way | ADAPT, Dublin City University | IE | CNGL, Dublin City University | IE |
| Author 5 | Qun Liu | Dublin City University | IE | ||
| Main Contact | Longyue Wang | ADAPT Centre, School of Computing, Dublin City University | None |
Documentation:
NA




